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ABSTRACT 



Statistical methods that can be applied in the 
sequential analysis of multivariate empirical data are provided. 
Twenty-three specific formulas for use under varying conditions are 
discussed. A historical sketch of sequential analysis since World War 
II and a bibliography are included. (AE) 
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MULTIVARIATE SEQUENTIAL ANALYSIS FOR EDUCATION 
Given a basic (simple) hypothesis testing situation with null hypothesis* 
ilternative hypothesis , probability of Type 1 error and probability of Type 11 
error, the sequential analysis (Wald) method involves the construction of the 
likelihood ratio at each stage of sampling and comparison of the ratio to boundary 
values, A and B. If the upper limit is broken with the first sample, the null 
hypothesis is rejected. If the lower limit is broken, the alternative hypothesis 
is rejected (or the null hypothesis i? accepted). If neither limit is broken, a 
second sample is selected and the product of the two likelihood ratios is compared 
to the boundary values, A and B. The procedure is generally continued until one 
of the two limits is broken. Using natural logarithms, the procedure may be repre- 
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panted as in diagram 1. 

Implicit calculations of the upper and lower boundaries involve solution of 
the simultaneous equations found in 2. Some computers can handle thi 3 
problem but more expedient methods are usually taken. Common estimates for A and 
B are found in section 3. Use of these estimates introduce new error probabilities 
into the hypothesis testing situation — it is somewhat comforting to know that the 
sum of the new error probabilities is les 3 than or equal to the sum of the desired 
error probabilities. 

The sequential analysis method for testing a hypothesis necessarily makes the 
sample size, say N, a random variable — N is defined as the size of the sample 
when a boundary is first broken. Calculation of the expected value of N from first 
conditions involves the summation of the series found in 4. Again, direct calcu- 
lations are difficult. Approximations for this mean value exist if the mean 
value of the likelihood ratio is not unity. For the particular case in which 
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population means are being tested, the approximation for the expected value 

of the random variable N, given the particular population mean value Ji is given 

in equation 5. Equation 6 gives this approximation for the special cases, jj 0 and ti a . 

The power of the test is needed for equation 5. From first conditions, the 
power is found by summing the infinite series in equation 7. Often, the power is 
approximated using a two step procedure. Under mild conditions, there exists 
a value h(ji) such that the expected value of the likelihood ratio raised to this 
power is equal to the constant, 1.0. Equation 8 gives two forms of this statement. 

If a non-zero value of h(ti) exists, the power of the sequential test may be estimated 
by the expression found in 9. 

I would now like to summarize the above by means of a common example. Assume 
the random vector with p elements is distributed normally with known covariance 
matrix E. The null hypothesis states the population mean vector is M 0 . The al- 
temative hypothesis states the population mean vector is JJa* The value z for this 
special case is given in section 10 a linear' combination of the p variates. 

For the special case in which p=l, section 11 gives the common expression and 
the expression more often used in quality control work. 

Continuing with the p-dimensional situation , the approximation for expected 
number of vectors which would have to be sampled before a decision is made if the 
null hypothesis is assumed to be true is given in equation 12. A similar form 
exists if the alternative hypothesis is assumed to be true. For other values of 
the population mean vector, this approximation involves the power of the 
sequential test. Section 13 gives the details. 

Perhaps now is the time for the question, "Why, sequential analysis?” The 
procedure was developed by Wald during World War II — its optimality is expressed 
as follows: the expected sample size under either H 0 or H a is usually a fraction 

of the fixed sample size test with comparable error probabilities. In the univariate 
~ase for means, the fraction is often % or less. Empirical work at The Florida 
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State University for a limited number of cases with dimensionality two or three 
show empirically that the fraction in these multivariate cases if often close to 
1/2. Comparable economy in other multivariate cases appears possible. 

All of the above comments have implied that one vector is sampled at each 
stage of testing procedure. It might be desirable to sample groups of vectors at 
each stage of the testing procedure. For the above special case, the only correc- 
tion would be a group si 2 e factor in the denominator of each term in the variance- 
covariance matrix. The expected sample size could be calculated as in equation 14. 
The author has taken another approach to sequential analysis in the multi'-ariate 
ca 3 Q. Using random normal vectors, the Chi-Square or F (possible Beta) distribu- 
tions are used in the likelihood ratio. Sampling is performed with groups of 
vectors. The advantages of this procedure are: 

A. The non-centrality parameter is constant for the F test 
corresponding to the Chi-Square test. Relationships between 
the two tests are possible. 

B. All approximations given in the first part of this papex* apply 
to this testing procedure. 

Certain expected values of random variables are given in section 15, 16, and 17. 

The simple hypothesis situation which was introduced before is summarized 
in section 18. If the first sample of n vectors yields a value z± which satisfies 
the statement of inequality, a second sample of n vectors is taken and the sum 
2 1 + z 2 compared to the boundaries. The complexity of the calculations assume 
the experimenter has a programmer available. The expected number of vectors which 
would have to be sampled if the null hypothesis is assumed to be true is 
approximated by the expression in section 19. 

The comparable test with unknown dispersion matrix is summarized in section 
20. One comment — the sample variance-covariance matrix is calculated for each 
group of measurements. The expected number of vectors which would have to be 
sampled if the null hypothesis is assumed to be true is given, approximately, 
by equation 21. 
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Section 22 gives the basic approach for multivariate analysis of variance. 
Section 23 gives the sequential "T" test in the univariate case. 



Closing Comments i 



1. It is not necessary to insure that the sample 3ize in each group of 
vectors is identical if the coranon approximations are not to be used. 

2. No literature exists for optimal group sizes to be used in the sequential 
analysis. 

3. The procedure can be extended to any standard hypothesis testing 
situation. 
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MULTIVARIATE SEQUENTIAL ANALYSIS FOR EDUCATION 

*«**£* 

1 » f 0 (£) is the density function of the random vector x under the null hypothesis, H 0 . 

f a (x) is the density fijnction of the random vector x under the alternative 
hypothesis, H a . ~ 

, f a<*> 

z i ~ ln 



/° c » y 

(Reject H ) In B < £L < In A 



^ * J 



(Reject H 0 ) 



****** 



2. 0 = P (Rejecting H Q j H 0 ).; yS * P (Rejecting H a \ H a ) 

Cl = P ( z 1 > In A | Ho*) + P ( In B< z j ^ In A, Zj + zg ^ In A | H Q ) + . . . 
/3 - PCz^inB | H a ) + P ( In B< z, < In A, z 1 + z^«£ln B [ H a ) + . . . 



*#***« 

3. 



A ~ 



B 55 



A 



I- Ci 



****** 

4. For a specified population mean vector n , 

E^(N) =1 • £ 1 - P(lnB< z t < lnA|^-)j + 2 *[l - P(lnS< z t < InA, lnB<2j+Z*«M|#}]* 



4* 



****** 



5. Given*/., P (Rejecting H Q = 1 (#-■). 

E L (N) - [l -/£ ln A la B. 

E ^ (2 i> 

****** 

6 * For /^ =/&o> 

E, (N)y d » In A + (1 ~ChlnB 

E ^ 2 i> 

For yU, E^(N) ~ (1^/g) InA ± £ • In B . 

~ *' E zo ( z 0 

Vv dU 
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7 . 1-/3 ^ = P( z t > In + P ( In B< z 1 < In A, z 1 + Z2sUn Ajyao + .. . 

**«*#* 

8. For population mean^ , h (^1 must satisfy 



= E 



- 4 f 






^aC 2 i) \ hG*fV 



:=: i 



l 4 w. 
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9. For a specified population mean vector and non-zero h 



1 ~/3 C&) x • . 

- B 

10. x i N( 4 ,I). 

2 i = ln f aC>Si > - ln f o^i) * £*1 + 2’ 

with C = C/^a ~/^o )* SI 1 and D = E- 

-V ^ '** ~ ““ o 



i.=k 

(Reject H a ) In B< .^feSi+S) •Cln A (Reject H 0 ) 

A.»/ 
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In 



B<Z 



(^a,->^S>Xj 



- a 1 _"] ^ In A . 



^-1 

For ^ M a.>^o » 



-i-k 



****** 



k ‘6 is ^<Jr xr< ^ nA )'v^) + k '(S^p) . 



E„ (N) « d . In A + (1 -a) In B 
** ~ E( Zi ) 

Vu * 



d * ln A + a “ChlnB 
iW D 



with C and D defined in 10 above. 
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<S ST- 

For x = N (>u. * > ) and z: = Cx,- + D, 

E ( e h ^)2i) = exp CCyu. + D + t-fo&Q QS Q fc ) = 1 - 

^ a 

Certain values of the population mean vector yield a non-zero value h C^O* 
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ft,?, «KW*fc» 



h(J&) 

For hC^) * 0 » 



ILQ^±Q2 

cXc* 



1 - B h <^> 
A h 0^ - b^G&O 



E^CN } - fl ~/g C*d MnA (^Vln B 

9 ^ + 5 

with O, D, defined in section 10. 

** *# fc* 

14. n vectors are sampled at each stage. 

f a (x t ), foCXi) are density functions of the sample mean vaetor „ 
z t = ln F aC0i) ~ In f o(*i> 



%[ N I 3 n * 



5 ~/5 C&ol"In A -f/^ G&Q-ln B 



E^.CZi) 



****** 
15. 



x follows the non-central chi-square distribution with p degrees 
of freedom and non-central Lty parameter <? 2 . 



E [xj - p + S 2 
Var (xl — 2p + 4^2 

r i -4?xr f* x Y 

E [_ln xj = In 2 + e *■ ^ y j* J 



a»<* ^ ! 




? M/P + i 



****** 

16. 



with 4^( a ) - P*(a) , the Digamma (or Psi) function. 

P (*> 

y follows the non-central F distribution with m and n degrees 
of freedom and non-central ity parameter £ a . 
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E l>] = 
E D«y| 



£ 2 



■) 

ft) -fh) 



n /l - 

n-2 V 



n 2* 



+ e 
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x follows the non-central Beta distribution with a and b degrees 
of freedom and non-centrality parameter a 2 . 

£ *2 £ . 

E[x] = e~T°a. '£ (Y ) L . 1 

*■** M a + b + i 

& 2 © o 

E fin xj = V(a) " ©“ T H XJ-JL - V Ca + b + i) . 

L'o jj 

**** ** 

18. n vectors are sampled; pk2. 




with 7vl:i ■ n(x. E" 1 Cj£ ~^ a ) 

“Xl:i = n(x. E " 1 (x t -^ 0 ), 



Inversion of a matrix can be avoided if the following relationship is used 

'.*r- 

"V 



* # * * ** 
19. 



IT 1 * * _lS_±jaLL_ - i • 

1EI 

E^fKf £ nT In A + (1 - Oj ) In B 1 

l J 
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n 



vectors are sampled; p 2. 



F 



z i s u il~ 
■ a CP oi p 



/£ 

l 2 



J 



in 



^ a:i 
F o:f 



-f* n 

2 



j 



/ - P ~ F 

In 1 3 + n--p o: 

? -E- F 

V 1 + n-p a 



with P_ . 

a • I 






*'o:i 



(n-p) , n(x. ^ J S: • (x. -v^O 

p(rt-1 ) 

<W»... • "Cil -Aio>§i~' <2t -A>'- 



PC"-’ ) 



**»>* H>4> 
21 ,. 



'iV* r 



ExcCzO - 



L 



cM’ In A + (H -<=>$.) In B 

eZFT” 



fs. - n 

l 2 / 



d'oO , \ . , 

T £-¥-iYf + i ) - 't(i) 



2“ 



4‘ Z ipi4>(n 



S’) 



J 



# * * * * *> 

22. 



Random samples of n vectors are taken from 3 multivariate 
normal populations „ 

x. # = "), -V, ~ N 0*C^ > % ^ and &3 “ 

The hypothesis of equal population mean vectors becomes 



H : 
o 



C 

FV 



yU 



w 



(oli ! •- O i C is a contrast matrix. 

**~ r S " j. j 



ttr t i 

(3) 



A'*; 



Op * (3p> 



c 

AJ 



1 ~JL 9 

T \ -2 1 

/>' « -i; 




is a common selection 












22 * cont. The alternative hypothesis may be: 
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H : C 

a ^ 
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= a = 
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Sequential procedures follow immediately with non-centrality 
parameter in both the known dispension matrix problem and unknown 
d is pens ion matrix problem equal to 




23. Univariate "t". 

n measurements are samples from the normal population. 
H o : A=Ao ; H a : A ~Aa 



F a*i * n C*i -Aa ^ S_1 C*i ~Aa^ * n 

s i 



F o:i * n fo.7#ft> 

Si 2 



.2 



z i = 



n 

2 



In 





i 

+ n-1 
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F o:i 


i 


+ J 


F a:i 




n-1 



It can be shown that the average sampling number in this situation 
is greater than the average sampling number- in the corresponding 
known-variance test. 

*## *** 

****** 




Historical Sketch of Sequential Analysis 



The initial breakthrough in sequential analysis was made by 
Wald (1945, 1947). Cox (1952) and Girshick (1946) extended the 
basic technique to variance tests and exponential family tests. 
Johnson (1953 ^ 1954) worked extensively with analysis of vari- 
ance. Papers on multivariate sequential analysis were written 
by Jackson and Bradley (1961 A, 1961B). Most of the more re- 
cent literature deals with arbitrary stopping rules — see Myers, 
Schneiderman and Armitage (1966). Theoretical work with 
non-central statistics presented in this paper depend heavily 
upon the Mel l in Transform as presented by Epstein (I948't. 
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